AITopics | Mason

Collaborating Authors

Mason

EEG based Continuous Speech Recognition using Transformers

Krishna, Gautam, Tran, Co, Carnahan, Mason, Tewfik, Ahmed H

arXiv.org Machine LearningDec-31-2019

--In this paper we investigate continuous speech recognition using electroencephalography (EEG) features using recently introduced end-to-end transformer based automatic speech recognition (ASR) model. Our results show that transformer based model demonstrate faster inference and training compared to recurrent neural network (RNN) based sequence-to-sequence EEG models but performance of the RNN based models were better than transformer based model during test time on a limited English vocabulary. Continuous speech recognition using non invasive brain signals or electroencephalography (EEG) signals is an emerging area of research where non invasive EEG signals recorded from the scalp of the subject is translated to text. EEG based continuous speech recognition technology enables people with speaking disabilities or people who are not able to speak to have better technology accessibility. Current state-of-the-art voice assistant systems process mainly acoustic input features limiting technology accessibility for people with speaking disabilities or people with no ability to produce voice.

continuous speech recognition, recognition, speech recognition, (12 more...)

arXiv.org Machine Learning

2001.00501

Country:

North America > United States > Texas > Travis County > Austin (0.15)
North America > United States > Texas > Mason County > Mason (0.04)

Genre: Research Report > New Finding (0.70)

Industry:

Health & Medicine > Health Care Technology (0.89)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.55)
Health & Medicine > Therapeutic Area > Neurology (0.49)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Continuous Speech Recognition using EEG and Video

Krishna, Gautam, Carnahan, Mason, Tran, Co, Tewfik, Ahmed H

arXiv.org Machine LearningDec-27-2019

--In this paper we investigate whether electroen-cephalography (EEG) features can be used to improve the performance of continuous visual speech recognition systems. We implemented a connectionist temporal classification (CTC) based end-to-end automatic speech recognition (ASR) model for performing recognition. Our results demonstrate that EEG features are helpful in enhancing the performance of continuous visual speech recognition systems. In recent years there has been lot of interesting work done in the fields of lip reading and audio visual speech recognition. In [1] authors demonstrated end-to-end sentence level lip reading and in [2] authors demonstrated deep learning based end-to- end audio visual speech recognition.

face frame, recognition, speech recognition, (9 more...)

arXiv.org Machine Learning

1912.0773

Country:

North America > United States > Texas > Travis County > Austin (0.15)
North America > United States > Texas > Mason County > Mason (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.69)
Health & Medicine > Diagnostic Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Improving EEG based Continuous Speech Recognition

Krishna, Gautam, Tran, Co, Carnahan, Mason, Han, Yan, Tewfik, Ahmed H

arXiv.org Machine LearningNov-24-2019

Improving EEG based Continuous Speech Recognition Gautam Krishna Brain Machine Interface Lab The University of T exas at Austin Austin, Texas Co Tran Brain Machine Interface Lab The University of T exas at Austin Austin, Texas Mason Carnahan Brain Machine Interface Lab The University of T exas at Austin Austin, Texas Y an Han Brain Machine Interface Lab The University of T exas at Austin Austin, Texas Ahmed H Tewfik Brain Machine Interface Lab The University of T exas at Austin Austin, Texas Abstract --In this paper we introduce various techniques to improve the performance of electroencephalography (EEG) features based continuous speech recognition (CSR) systems. A connectionist temporal classification (CTC) based automatic speech recognition (ASR) system was implemented for performing recognition. We introduce techniques to initialize the weights of the recurrent layers in the encoder of the CTC model with more meaningful weights rather than with random weights and we make use of an external language model to improve the beam search during decoding time. We finally study the problem of predicting articulatory features from EEG features in this paper . ASR systems forms front end or back end in many state of the art voice assistant systems like Bixby, Alexa,Siri,Cortana etc.

articulatory feature, eeg feature, gru layer, (12 more...)

arXiv.org Machine Learning

1911.1161

Country:

North America > United States > Texas > Travis County > Austin (1.00)
North America > United States > Texas > Mason County > Mason (0.24)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Diagnostic Medicine (0.66)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.34)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback